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Abstract. The calculation of crystal structure from X-ray diffraction 
data requires that the phases of the "structure factors" (Fourier coeffi- 
cients) determined by scattering be deduced from the absolute values of 
those structure factors. Motivated by a question of Herbert Hauptman, 
we consider the problem of determining phases by direct algebraic means 
in the case of crystal structures with n equal atoms in the unit cell, with 
n small. We rephrase the problem as a question about multiplicative in- 
variants for a particular finite group action. We show that the absolute 
values form a generating set for the field of invariants of this action, and 
consider the problem of making this theorem constructive and practical; 
the most promising approach for deriving explicit formulas uses SAGBI 
bases. 



If the unit cell of a crystal has n atoms, located at positions fj, 1 < j < n, 
then the structure factor associated to a reciprocal lattice vector v is 



where the aj are the scattering amplitudes determined by the electron charge 
distribution in the j-th atom. This is, in effect, a Fourier transform coef- 
ficient, and the structure of the crystal can be determined from the E^^ by 
an inverse Fourier transform. However, in standard diffraction experiments, 
it is impossible to measure the E-^ - only their absolute values are observ- 
able. The "phase problem" of crystallography is to determine the phases 
of the -Ev given magnitudes \Ey^i\ for sufficiently many v'; this problem is 
fundamental in the subject, and has received considerable attention f jGiaj . 
|Ha2j ). The problem of retrieving phase information from absolute values, 
together with other physical constraints, occurs in several other areas of 
physics, astronomy, and engineering. 
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In the crystallographic context, the phases can be determined in principle 
only up to an additive constant, which is equivalent, via a Fourier transform, 
to the indeterminacy of the origin of the crystal. It is natural to consider 
"structure invariants," which are multiplicative combinations of structure 
factors that are invariant under change of origin, i.e., additive translation 
of the phase. In addition, structure invariants play an important role in 
commonly used stochastic methods for phase retrieval. The function 

(1) £'vi^v2 • • •-E'vm 

is easily seen to be a structure invariant when the reciprocal lattice vectors 
Vi sum to zero. The most common case is m = 3, i.e., the "triplet-structure 
invariant" 

E E E 

Exact formulas for phases of triplet structure invariants are known in 
terms of magnitudes for n = 1,2,3 |Halj . Herbert Hauptman asked one of 
us for a formula for arbitrary n, and the purpose of this paper is to show 
that, at least in the case in which all atoms in the crystal have the same aj, 
such a formula exists, and to explore techniques for finding such formulas 
explicitly. 

In this paper we shall only consider crystals with equal atoms (or equal 
polyatomic clumps); we will set the identical scattering factors aj equal to 1. 
In addition, it is convenient to assume that the space group is the most basic 
group PI (isomorphic to the group of translations); see |Hall Appendix 
1] for a description of how to generalize to arbitrary space groups. 

Under these assumptions the triplet phase determination problem can be 
converted to a question about multiplicative invariant functions. Somewhat 
to our surprise, this question seems to be new. 

To express the triplet phase problem to a problem in symmetric func- 
tions, start by noting that the phase (/>vi,v2 of the triplet structure factor 
£'vi-Ev2-E_vi-v2 satisfies 

-E'Vl -^V2 Vl— V2 — l-^Vll |-^V2l 1-^— V2— V2I 6Xp (i0vi , V2 ) • 

Thus the cosine of the phase can be expressed in terms of absolute values 
and the sum of the triplet-structure invariant and its complex conjugate. 
We want to express the phase 0vi,v2 terms only of absolute values 

IP |2 Tp 771* rp rp 

\-^avi+bv2\ civi+bv2 avi+bv2 -'-^avi+&V2^— avi— 6v2 

corresponding to reciprocal lattice vectors avi + 6v2, where a and b are 
integers. 

Fix vi and V2 and let 

Xj = exp(27rivi • r^), i/j = exp(27rzv2 • r^), 1 < j < n 

X = {xi,- ■ ■ ,Xn), Y = {yi,--- ,yn) . 
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Then 

n n 

^vi = ^ Xj and = E ■ 

Using the fact that z + z* = 2\z\cos{(p) if z is a complex number with 
absolute value \z\ and argument cp, we see that we need to express 

-VI -V2 + -£'-Vl-E'-V2-^Vl+V2 
= l-Evil |-E'v2l |-E'-V-V2l 2 COs((/>vi,V2) 

n n "l n ^ n ^ n 

j=i fc=i i=i j=i ^ k=i 1=1 

n n 
j,k,l=l j,k,l=l ^^'^ 

in terms of the magnitudes 

Qa,b{xi, . . . ,Xn,yi, . . . ,yn) ■= E^E^^ 

n n ^ a b 

= E ■ E "^i ^ " E ' 

i=i i=i i,j=i-^J^J 

for suitable integers a and b; here v = avi + 6v2 is an arbitrary reciprocal 
lattice vector. 

We will call ^^,6 an "observable" since it is (the square of) an absolute 
value and therefore it is possible to observe it physically. Thus our goal 
is to express E2 SI'S ci rational function of the observables Qab- From now 
on we will treat this as a question about variables Xi and yj, ignoring the 
fact that they are complex numbers of absolute value one. A simple Zariski 
density argument shows that this does not change the underlying problem, 
i.e., we are not going to "miss" any identities by assuming that Xi and yi are 
arbitrary complex numbers, rather than just those of absolute value one. 

Example 1.1. The reader can easily verify that for n = 2 the triplet phase 
invariant is a polynomial in three observables: 

E2{X, Y) = 2((7i,o + qo,i + qi,i) - 8. 

Example 1.2. The formula for n = 3 is considerably more elaborate, and 
was discovered by Hauptman |Halj : it takes the form E2{X,Y) = N/D, 
where 

Xi yi Xiyi \ 



D := qo,i + qi,o + qi,l - ^ = + YI " + + 
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and 

N := 135 - 311) + Z)2 + 2(^1,090,1 + ^1,091,1 + 9o,i9i,i) 
+ (90,lQ'2,l + qi,oqi,2 + qi-iqi,i) 

-5(gi,2 + 92,1 + 91, -l) - 2(90,2 + 92,2 + 92,0) • 

(For another formula, see Example 16.51 1 

Higher order generalizations of triplet structure invariants are defined as 
a product of structure factors where the v sum to 0. If a structure 
invariant is the product of m factors Q for arbitrary m, then this gives rise, 
in a similar manner, to the problem of expressing 

n 

. _ \ Xij . . . Xmj ^ljiX2j2 ■ ■ ■ ■^mj 

J Jli---ijm — J- 

as a rational function in observables 



(3) 9ri,...,r™ := Yl 



x\\ . . . xl" 



where ri , . . . , are integers. 

Note that the rational function / = E^ has the following invariance 
properties. 

a: / is of weight in each n- variable vector 

X\ — (xii , . . . , Xin) , . . . , X^ — (s^ml ) • • • ) XjYin) • 

That is, /(ciXi, . . . , CmXm) = fiXi, . . . , Xm) for any non-zero scalars 
ci, . . . ,Cm, or, more succinctly: f{cjXij) = f{xij). 
b: / is self-reciprocal in the sense that it remains unchanged if every 
variable Xij is simultaneously replaced by i.e., f{x^/) = f{xij). 
c: / is multi-symmetric in the sense that it remains unchanged if the 
variables in each array Xi are (simultaneously) permuted by the 
same permutation a G S^, i.e., f{xi„{^j-)) = f{xij). 
Our main results are summarized in the following theorem. Both parts 
answer questions posed by H. Hauptman. 

Theorem 1.3. (1) f Theorem \ 5. 1\ or Theorem \6.1\) Every rational function 
f{xij) in mn variables, satisfying the invariance properties a, b, c, can be 
expressed as a rational function of the observables qci,...,cm- -^'^ particular, 
Em can be expressed as a rational function of the observables qci,...,cm for 
any m > 1. 

(2) ( Proposition \7.1^ b)) Suppose n > 4. Then Em is not a polynomial in 
the observables qci,...,cm f^^ any m >2. 

Our proof of Theorem 1 1 . 31 r eg uir es the consideration of all / satisfying the 
invariance properties a, b, c, even if one is only interested in the structure 
invariants Em- This is true of most of our other proofs and algorithms. 
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The overall outline of the paper is as follows. In Section [3 we prove 
(a slightly more precise version of) Theorem I1.3r 2) for m = 1 using basic 
Galois theory and some combinatorial arguments; the proof is not obviously 
constructive. In Section|31we consider two possible approaches to making the 
argument constructive. In the subsequent section we give a fast algorithm 
for n < 4, based on SAGBI bases. In Section El we turn our attention 
to the multi-array case, i.e., m > 1. In Section El we reduce the problem of 
computing the invariant Em to that of expressing certain invariant functions 
in terms of observables in the single array case (m = 1). We then use the 
SAGBI basis algorithm of Section to obtain new expressions for E2 in the 
case where n = 3 and 4; see Examples 16.51 and 16.61 In Section [3 we prove 
Theorem I1.3r 2). Finally, in Sectional we study the structure of the field 
of rational functions / satisfying the invariance properties (a) - (c) as an 
abstract field, without reference to the observables. 

In the course of our work on this paper, we have encountered a phe- 
nomenon that often arises in the interstices between mathematics and its 
applications. Depending on the context, solving a mathematical problem 
can mean many different things, e.g., 

(a) proving a theorem, 

(b) giving a constructive proof, or 

(c) giving an algorithmic proof, suitable for practical computations. 

As one moves down this list, the problem can become more difficult, requir- 
ing different techniques and ways of thinking. However, there is usually a 
subtle but important interplay between these different modes of solution. 

Acknowledgments 

The authors would like to thank H. Hauptman for bringing this problem 
to their attention, M. Lorenz for contributing Proposition l8.ll and D. Eisen- 
bud, N. Elkies, J. Friedman, I. Laba, H. Lenstra, G. Martin, D. Peterson, 
C. Procesi, I. Swanson, and R. Thomas for helpful conversations. 

2. One set of variables 

Fix a field k of characteristic 0. Let X = (xi, . . . ,Xn) be an n-tuple of 
independent variables over k. We will operate on n-tuples as if they were 
diagonal matrices, so = {x^^ , . . . , x~^), ti{X) = xi + ■ ■ ■ + x„, etc. 

Let k{X)Q C k{xi,--- ,Xn) be the field of rational functions in the Xi 
of total degree 0; in other words, an element / G k{X)Q is a quotient of 
homogeneous polynomials of the same degree. Equivalently, k{X)Q is the 
field generated by the Xi/xj: 

k{X)o = k{xi/xj ■■ I < i,j < n,i ^ j) . 

We note that the field k{X)Q can also be viewed as the function field of the 
projective space P"~^. 
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The symmetric group S„ acts on k{X)o = k(xi,--- ,Xn)o by permuting 
the variables xi, • • • , in the natural way. In addition, we let r denote the 
automorphism that takes X to X~^, i.e., 

/ , 1 
r(xi) = — , 1 < i < n. 

This automorphism is obviously of order two, and we let T = {1, t} denote 
the corresponding group. The actions of S„ and T commute so that the 
group 

G := S„ X T 

acts on k{X)Q. This action is faithful for n > 3. If n = 2, G = S2 x T has 
order 4, and the kernel of its action on k{xi,X2)o is the subgroup of order 2 
generated by (o", r), where a is the nontrivial element of S2. 
The main theorem of this section is that the observables 

n , X r 

(4) := tr(X'-)tr(X-n = ^ p 

M=i ^^■'■^ 

generate the invariant field k{X^^ . (Note that here m = 1, so that the 
observables (jSJ have only one subscript.) 

Theorem 2.1. 

k{X)^ = k{qr 1 1 < r < n(n - l)/2) . 

Before proving the theorem, we prove two lemmas, one combinatorial and 
the other algebraic. 

Let n > 2 be an integer, N = n{n — 1), and A be the A^-element set 

(5) A = I i,j = l,...,n and i / j}. 

When convenient we will sometimes omit the comma and write (ij) instead 
of In addition, we tend to visualize the elements of A as the off- 

diagonal positions in an n by n matrix. 

There is a natural action of G = S„ x T on A, where r acts by transposition 

■r{i,j) = 

and the symmetric group S„ acts simultaneously on the rows and columns: 
(^ihj) = {o-(.i),(T{j)), cr G S„. 

This gives a map from G to the symmetric group = S„2_„ = Sym(A) of 
all permutations of A, and this map is an injection for n > 3, in which case 
we will usually just choose to regard G as a subgroup of S^y. In the case 
n = 2 the map is surjective with kernel of order two as described earlier. 

We will say that elements x = and x' = of A are opposite 

if X is the transpose of x' , i.e., i = j' and j = i' . If exactly one of these 
equalities holds, i.e., x' is not opposite to x, but it lies in the same row or 
column as the transpose of x, then we say that x and x' are adjacent. Note 
that opposite pairs are not also adjacent. 
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We will say that g & Sn preserves adjacency (respectively opposition) 
if for any adjacent (respectively, opposite) elements x, x' G A, the images 
g{x) and g{x') are also adjacent (respectively, opposite). Our combinatorial 
lemma says that for n > 2, G = S„ x T is precisely the subgroup of Sat that 
preserves both of these relations. 

We note that for n = 2 the situation is simple: G maps onto Sat = S2 
and the nontrivial element of Sat preserves both adjacency and opposition. 
So from now on we consider n > 3. 

Lemma 2.2. Let n > 3. Then h £ Sn preserves both adjacency and oppo- 
sition if and only i/ /i € G C Sat . 

Proof. It is immediate from the definition that every h in G preserves both 
adjacency and opposition, so we only need to prove that any element pre- 
serving these relations lies in G. 

Suppose h £ Sn preserves both adjacency and opposition. To show that 
/i € G, we will multiply h by elements of G until we arrive at the identity 
permutation of A. 

It is easy to see that S„ acts transitively on A. Thus, after composing h 
with an element of S„ C G, we may assume h{12) = (12). 

We claim that we may also assume that h{13) = (13). Indeed, since h 
preserves opposition, h{12) = (12) implies h{21) = (21). Suppose h{13) = 
(ij). Since (21) and (13) are adjacent, so are (21) and (ij), i.e., either 
i = 1 or j = 2. If i = 1 then j > 3; thus after replacing h by [3,j]/i, we 
obtain h{12) = (12) and /i(13) = (13), as desired. (Here [3,j] denotes the 
transposition in S^ that interchanges 3 and j.) On the other hand, if j = 2 
then i 1,2 and, after replacing h by [3, i][l, 2]r/i, we once again obtain 
h{l2) = (12) and h{l3) = (13). This proves the claim. 

Since h preserves opposition, h{13) = (13) implies that h{31) = (31). 
Now, since (23) is the unique pair adjacent to both (31) and (12), and /i(23) 
is the unique pair adjacent to both /i(31) = (31) and /i(12) = (12), we 
conclude that H{23) = (23). Since h preserves opposition, we also have 
/i(32) = (32). Thus h fixes [ij) for 1 < i, j < 3. This completes the proof of 
Lemma I2. 21 for n = 3; from now on we will assume that n > 4. 

Suppose h{li) = [ah) for some i > 4. Since (21) and [li) are adjacent, so 
are (21) and {ah). That is, either a = 1 or 6 = 2. Repeating this argument 
with (31) in place of (21), we see that either a = 1 or 6 = 3. Since b 
cannot be equal to both 2 and 3, we conclude that a = 1. In other words, 
h{li) = (l(T(i)), where o" is a permutation of 4, 5, ... ,n. After replacing h 
by (T~^h, we reduce to the case where h fixes (li) (and thus (il)), for every 
i = 2, . . . , n. 

We claim that h is the identity permutation, i.e., that h{ah) = {ah) for 
every (ah) £ A. Since we know this in the cases where a = 1 or 5 = 1, we 
may assume a,b > 2. In this case (ah) is the unique element of A that is 
adjacent to both (la) and (61). Hence, h{ab) = (ah), as claimed. □ 
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Next, we prove an algebraic lemma from which the theorem will follow 
easily. Let 

J = 1 

* 7^ j 

be the polynomial of degree = n(n — 1) whose roots are the Xi/xj, i ^ j. 
The coefficients Cj are the elementary symmetric functions in those roots, 
and since the reciprocals of roots are themselves roots, the polynomial / 
satisfies = /(t), which is equivalent to 

c-i = CN-i, < i < N. 

The elements Qr = tr(X'')tr(X^'") are symmetric functions of the Xi/xj 
are hence polynomials in the Cj; we let 

K ■.= k{qr\r = 1,2,- ■■) 

be the field generated by the observables Qr- 

As we will see, the proof of Theorem 12 . 1 1 basicallv comes down to deter- 
mining the Galois group of / over the field K. 

Lemma 2.3. With the above notation, 

(a) k[ci, . . . ,Cr] = k[qi, . . . q^] for any r, 1 < r < N . 
(h) K = k{ci,.. .,cn/2) = Hqi,-- ■,Qn/2)- 
(c) k{X)Q is the splitting field of f{t) over K. 

Proof. Since 

qr-n = tr(X'^)tr(X-^) -n = ^{xi/xj f 

is the sum of the m-th powers of the roots of /(t), part (a) follows from New- 
ton's formulas that express the symmetric polynomials ci, • • • ,Cr- in terms 
of the power sums gi, • • • , Qr- 

Part (b) follows from part (a) and the symmetry of the Cj. Part (c) follows 
from (b) and the fact that k{X)Q = k{xi/xj). □ 

We are now ready to finish the proof of Theorem l2.1l Clearly, K C k{X)Q . 
Consider the tower 

k{X)Q 
k(X)^ 
K 

of field extensions. By the lemma, k{X)Q is a Galois extension of K. Identify 
the set of roots of f{t) with the set A = {(i, j) | z 7^ j}, letting Xi/xj <-> (i, j)- 
The action of G = x T on the set of roots is the same as its action on A 
described earlier. 
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The notions of adjacency and opposition in A have a natural interpretation 
in this context. If x = (ab) and x' = (cd) are elements of A let r = Xa/xb and 
r' = Xc/xd be the corresponding roots of fit). Then x and x' are adjacent 
if and only if rr' is again a root of f{t) and opposite if and only if rr' = 1. 
Thus any Gal(/c(X)o /K) acts on the set of roots in a way that preserves both 
adjacency and opposition. Lemma l2!2l now tells us that Gal(A;(X)o/^) = G. 
Thus k{X)Q = K. This completes the proof of Theorem 12.11 

Remark 2.4. Since C G acts transitively on the roots of /(t), we con- 
clude that f{t) is irreducible over k{X)^ . 

3. Constructive proofs 

The proof in the last section ultimately relies on a fundamental and beau- 
tiful result in Galois theory: anything fixed by all elements of a Galois group 
lies in the ground field. We will now discuss a constructive proof, which could 
be viewed as the result of tracing through the argument of the previous sec- 
tion, rendering the underlying Galois theory explicit at each step. We note 
that both proofs rely on Lemma 12.21 

We begin by letting /c[X^^]o be the /c-algebra whose elements are /c-linear 
combinations of Laurent monomials a;°" . . . x^" of total degree 0, i.e., where 
the ai are integers whose sum is 0. Note that fc[X^-^]o is generated, as a 
/c-algebra, by elements of the form xi/xj] in particular, the field k{X)Q is 
the field of fractions of A;[Ar^^]o. Note also that A;[Ar^^]o is a G-invariant 
subring of k{X)Q. 

Let Zij be a set of = n(n — 1) algebraically independent variables over 
A;, where i and j are distinct integers between 1 and n. For notational 
convenience, we also set zu = 1 for every i = 1, . . . ,n. We now define a 
surjective A:-algebra homomorphism 

k[z,j\^k[X^% 

by (p{zij) = Xi/xj. Let Sr be the rth elementary symmetric polynomial in 
the A^ variables Zij and pr = ^Ij be the sum of the rth powers of these 
variables. Here sq = 1 and (l){sr) is the element of A;[X^^]o we called Cr in the 
statement of Lemma 12.31 We let S n denote the group of all permutations 
of {zij : i ^ j}, and identify G = S„ x T with the subgroup that acts on the 
Zij by a{zij) = for a G S„, and T{zij) = zji. 

Define three polynomials Di,D2,D E k[zij] of A^ variables by 

Dl{zij) = Y\ {ZabZcd - 1) , 

(ab) and (cd) G A 
are not opposite 

D2{Zij)= {ZabZcd - Zef) . 

(ab) and {cd) G A 
are not adjacent 

and D = DiD2- As we shall see below, (p{D) is a "universal denominator", 
such that if / € A;[X=ti]G then 0(D)/ is a polynomial in the observables. 
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Lemma 3.1. (a) Suppose g & G. Then (j){gD) = (j){D) ^ 
(b) Suppose g ^ G. Then (j){gD) = 0. 

Proof, (a) Clearly gDi = Di and gD2 = D2, because g preserves both 
adjacency and opposition. Thus gD = D, so that (j){gD) = (j){D). To show 
that (j){D) 7^ 0, note that if the image under of a factor of Di is zero then 

Xb Xd 

This implies that {ah) and (cd) are opposite elements of A, which is excluded 
by the definition of Di. This shows that (p{Di) ^ 0. Similarly, (j){zabZcd — 
Z(.f) = if and only if [ab) and {cd) are adjacent in A. Thus (j){D2) 7^ 0, and 
consequently, (j){D) ^ 0. 

(b) If (7 G S„ is not in G then by Lemma 12.21 at least one of the following 
holds: (i) g does not preserve opposition or (ii) g does not preserve adjacency. 

If (i) holds then g~^ does not preserve opposition either. In other words, 
there exists a pair of non-opposite elements (ab) and (cd) such that g{ab) 
and g{cd) are opposite, say, 61 = ci. Then ZabZcd — 1 is a factor of Di and 
4>{9{zabZcd - 1)) = 0. Hence, (j){g{Di)) = and thus (t){g{D)) = 0. 

Similarly, if (ii) holds then there exists a pair of non-adjacent elements 
[ab) and {cd) such that g{ab) = (oi^i) and g{cd) = (cidi) are adjacent, say, 
bi = ci. Now 

<P{g{ZabZcd - Zg-i(aidi))) = <P{ZaibiZc^di " ^^aidj) = = 

SO that 0(17(1)2)) = and thus (j){g{D)) = 4>{g{Di))(l){g{D2)) = 0, as claimed. 

□ 

With the universal denominator 0(1?) in hand, we can now state our 
algorithm. 

Algorithm 3.2. 

Input: A function f{xi, . . . ,Xn) in k{X)Q . 

Output: A rational function in the qr representing /. 

Step 1: Write / = /1//2 of a quotient of elements fi that are in 
/^[X='=^]q'; apply each of the subsequent steps to /i and /2 (to simplify 
the notation we just assume from now on that / G k[X^^]Q). 

Step 2: Find an element F € k[zij] such that 4>{F) = f. 

Step 3: Set A = X^geSjv 9i^^)i write the numerator and denominator 
of 

4>{A) 
2nl4>{D) 

as polynomials in the Qr and output the result. 
We comment on each step in turn. 

Step 1: To express an invariant degree rational function / as a quotient 
of invariant polynomials, recall that / is a quotient of two polynomials of 
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equal degree, say of degree d. Dividing top and bottom by xf, we can write 
/ in the form /i = a/b, where a and b are in i^[X^-'^]o (but a and b may not 
be G-invariant). Since / is G-invariant, 

f = J-V (f) = —S^^^ = ll 
2n\^^^^' 2n\^g(b) /a 

where the numerator and denominator /j are G-invariant polynomials in 
/c[X='=^]q'. It suffices to express /i and /2 as rational functions in the qr-, and 
we can therefore assume that / is in h\X^^\^ from now on. 
Step 2: To lift / to an element F € k[zij], write f{xi, . . . /c-linear 
combination of degree-0 Laurent monomials . . . xj^" , with ai + • • • + a„ = 
0. Any such monomial is a product of a finite number of terms of the form 
Write all of the monomials in / in this form and replace each ^ by Zij 

to obtain the desired F G k[zij]. 

Step 3: By Lemma l3. II the only non-zero terms in the sum 

ct>{A) = cP{g{Dmg{F)) 

correspond to g £ G. Thus 

cPiA) = ^ cPig{Dmg{F)) = ^ c^{D)g{cj){F)) = 2n\ (^(D) f , 
geG geG 

as claimed. 

The only remaining thing that needs to be done is to explain how to 
write (j){A) and (p{D) as polynomials in the qr- Clearly ^ is a symmetric 
polynomial in the Zij, where 1 < i,j < n. It is therefore a polynomial in 
the elementary symmetric functions Si, 1 < i < N, and a polynomial in 
the power sums Pr = z^j by Newton's formulas. Since (j){pr) = qr — n it 
follows that (p{A) can be written as a polynomial in the qr, as desired. 

From the formula 

2n\<i,{D) = mm = mm = ^[Y ai^) ) 

geG geSpf \geSN J 

it follows that, once D is symmetrized over Sat, the same procedure can be 
used to express 2n\(j){D) in terms of the observables. 

Remark 3.3. The above procedure can be modified to produce polynomials 
in the qr with r < N/2. Indeed, recall that 4){sN^i) = c^v-j = Cj = 4>{si). 
Thus the image under of a polynomial i-*(si, • • • ,SAr) in the elementary 
symmetric functions is unchanged if Sj is replaced by s^-i for i > N/2. 

Note also that if one only wants to express /(xi, . . . rational 
function in qr for 1 < r < A^, rather than 1 < r < N/2, then the algo- 
rithm of IStil Proof of Proposition 1.1.2] can be used to write A and the 
symmetrization of D directly as polynomials in the power sums pi without 
going through elementary symmetric polynomials. 
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4. Initial terms and SAGBI bases 

It is natural to try to apply Grobner bases to our problem. Although 
Grobner bases are usually applied to rings, they can be adapted to solve 
problems in function fields. This was first noticed in detail by Sweedler |Swj . 
and has since been extended in several theoretical and practical ways; see, 
e.g., |M]- In our context we would have a G-invariant polynomial f{X) in 
Xi/xj that, by Theorem 12. H lies in the field k{qk) C k{X)Q. By introducing 
extra variables and calculating the Grobner basis of a suitably chosen ideal, 
an explicit expression can be found for / as a rational function in the qi. 

Unfortunately, all of our implementations of this idea suggest that it has 
the same trouble as implementations of the constructive algorithm given in 
the previous section: they are far too slow. The purpose of this section is 
to introduce a faster algorithm, for n < 4, using a variant of Grobner bases 
called SAGBI bases. 

We shall always assume that / G ; the general case reduces to this 

one (cf. Algorithm 13.21 Step 1). 

The subduction algorithm. Given an element 

...,Xn) = Y^ Caa:" e k[X^^] = k[xf^, . . . , x^^] , 

where a = (ai, . . . , o„) G Z", = x'^^ . . . x"" and Ca G k. We will write 
in{p) for the initial exponent p, i.e., the lexicographically largest exponent 
a such that Ca ^ 0. If i? is a subalgebra of then {in{p) \ p G R} is 

clearly a subsemigroup of Z*^; this semigroup is usually denoted by in{R). 
We are interested in the case where R = klX^^]^; in this case in{R) consists 
of elements a = (ai, . . . , a„) G Z" satisfying the following conditions: 

(i) ai H h a„ = 0, 

(6) (ii) ai > • • • > a„, and 

(iii) (ai, . . . , o„) ^ {-an, -ai). 

Here and in the sequel, >- denotes the lexicographic order on Z". 

Proposition 4.1. Suppose B is subset of R = k[X^^]Q chosen so that the 
elements in(6) generate in{R) as a semigroup, as b ranges over B. Then 
R = k[B]. 

Our proof below is based on the subduction algorithm of Robbiano- 
Sweedler |E,Sj and Kapur-Madlener |KMj for expressing a given element 
a G i? as a polynomial in elements of B. 

Proof. We want to write a G i? as a polynomial in elements of B. If 
a = 0, we are done. Otherwise write in(a) = eiin(6i) + ••• + erin{br), 
where bi, . . . ,br € B and ei, . . . , are non-negative integers. Then a and 
bi^... ft^"" have the same leading exponent; thus for some c (z k, 

a\ = a — df^ . . .b%^ 
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has a lexicographically smaller leading monomial than a. If a\ = 0, we are 
done. If not, we can replace a by a\ and apply the same procedure. That 
is, after subtracting a monomial in elements of B from ai, we obtain 02 € ii 
with a smaller initial exponent, etc. 

In order to complete the proof of the proposition, it is enough to show that 
the resulting sequence a = ao,ai,a2, ... in i? will terminate, i.e., Ur = 
for some r > 0. This is a very special case of Re, Proposition 6.5]; for the 
sake of completeness we give a direct proof below. 

By our construction in(ao) >~ in(ai) >- in(a2) y ■■■■ Thus it suffices 
to prove that for any given a = (ai, . . . , a„) G in(i?) there are only finitely 
many a' = {a[, . . . , a'^) G in(i2) such that a ^ a' . Indeed, if a ^ a' then < 
a'l < ai- Now condition (iii) says that > —a'^ > — ai, and condition (ii) 
says that — ai < a'^ < a'- < a[ < ai. Thus may assume only finitely many 
values for every i = 1, . . . ,n. This completes the proof of Proposition 14.11 

□ 

For computational purposes, we are interested in those cases, where the 
set B in Proposition 14.11 can be chosen to be finite, i.e., in(i?) is a finitely 
generated semigroup. In such cases we shall refer to i? as a SAGBI basis 
of G; cf. |E,e| Introduction]. (Here SAGBI stands for "subalgebra analog to 
Grobner bases for ideals"; this term is due to Robbiano and Sweedler |RSj .) 

Unfortunately, by |Rel Theorem 1.6] ^[X^^];^ has a SAGBI basis only for 
n = 2, 3 and 4; see also |Re[ Example 7.3]. Moreover, the situation cannot 
be remedied by replacing the lexicographic order with a different term order. 
On the other hand, for n < 4 the subduction algorithm is much faster than 
Algorithm 13.21 

Explicit SAGBI bases. Let q be the ith elementary symmetric polyno- 
mial in Xi/xj, as in Lemma 12.31 (Here i and j are distinct integers ranging 
from 1 to n.) Recall that ci = qi — n = ti[{X)ti:{X~^) — n. 

Lemma 4.2. The following elements form a SAGBI basis of k[X^^]Q . 
(a) d, ifn = 2. 

(h) ci and C2, ifn = 3. 

(c) ci,C2,C3 and p, if n = 4. Here p = S2{X)s2{X~^), where 

S2{X) = X1X2 + X1X3 H h X3X4 

is the second symmetric polynomial in X = (xi, . . . , 3:4). 

Proof. Let S = in{k[X^^]Q) be the subsemigroup of given by ©. 

(a) If n = 2 then S is clearly generated by (1, —1) = in(ci) as a semigroup. 

(b) For n = 3, S" is generated, as a semigroup, by the elements Ai = 
(1, 0, —1) = in(ci) and A2 = (2, —1, —1) = in(c2). Indeed, every element of 
/i € S" is of the form /x = (a, — c, —6), where 6 > c > and a = b + c. Thus 
// = cAi + (6 — c)A2 lies in the semigroup generated by Ai and A2. 
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(c) We want to show that any fi = (a, b, c,d) E 5 can be written as a 
non-negative integer hnear combination of 

Ai = (1,0,0,-1) = in(ci), 

A2 = (2,0,-l,-l)=in(c2), 

A3 = (3, -1, -1, -1) = in(c3) and 

A4 = (1,1,-1,-1) =in(p). 

If 6 < then the desired hnear combination is given by 

fi = {c- d)Xi + {b- c)A2 + (-6)A3 . 

If 5 > then, after replacing fihy — 6A4, we can assume 6 = and apply 
the above formula. □ 

Corollary 4.3. (a) If n = 2 then k[X^'^]^ = k[ci] = k[qi]. 

(b) Ifn = 3 then k[X^^^ = k[ci, C2] = k[qi, 92]- 

(c) Ifn = A, k[X^^]^ = /c[ci,C2,C3,p] = k[qi,q2,qz,v\- 

Proof. Immediate from Lemma l4.2| Prop osit ion 14 . 1 1 and Newton's formulas; 
cf. Lemma ESfa). □ 

Example 4.4. Let n = 3. Corollary 14.31 tells us that ci and C2 form a 
SAGBI basis for k[X^^\Q . For instance, applying the subduction algorithm, 
we obtain: 

E2{X,X)=2{c\ + ci-C2). 

Similarly, 

E2{X, X^) = 2c? + 5c? - 5ciC2 + 9ci - 12c2 + 18 . 
These identities will be used in Sectional 

Example 4.5. n = 4. Using the subduction algorithm of Proposition 14.11 
to express and C4 in terms of the SAGBI basis ci,C2,C3,p, we obtain the 
following relations in the ring ^[X^Jq': 

C2 = 2c\p — lQc\ — 8ciC2 — cip^ + Ibcip — 48ci + 

3c2P - 12c2 + C3P - 2p2 + 18p - 36 
C4 = 3cf + C1C2 - 3cip + 17ci + C2 - 3c3 - lOp + 21 . 



Eliminating and solving for p gives 

6 + 7ci + 7cf + 3cf — 10c2 — 5ciC2 + c\c2 — c\ — 6C3 — 3ciC3 — 2c4 — C1C4 
^ 2 + ci + cf - 3c2 - C3 

This shows that p G A;(ci, C2, C3, C4) or, equivalently, p S k{qi, . . . , 54). Thus 
by Corollarv 14.31 kiX^^ = k{qi,q2,qz:Qi)- Recall that Theorem 12.11 asserts 
only that k{X)Q = k{qi, . . . , q^); we have thus shown that the last two of 
these generators are not needed. 
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To obtain explicit expressions for C5 and cq as rational functions in ci, . . . , C4, 
we use the subduction algorithm once again: 

C5 = cf - 6ci - 5ciC2 + 7cip - 38ci + C2P - 7c2 + 6C3 - 2^^ + 20p - 42 
C6 = -2ci + cfp + 6ciC2 - 5cip + 27ci - 2c2P + 9c2 - 7c3 + 
2p2 - l8p + 34 , 

then substitute the above formula for p. 

Note also that for n = 3, Theorem 12.11 savs that A;(X)q' = k{qi,q2,qs), 
but 53 is not needed by Corollary I4.3r b) . We do not know whether or not 
any of the generators listed in Theorem 12. II can be left out for n > 5. 

5. More sets of variables 

The purpose of this section is to generalize Theorem l2.1l to the multi-array 
case. That is, instead of considering a single array of independent variables 
X = {xi, . . . , Xn), we shall consider m arrays: 

Xl = (Xii, . . . ,Xin) , 
X2 = {X21, ■ ■ ■ ,X2n) , 

Xm — {Xml: • • • 5 ^mn) • 

We shall view such n-tuples as diagonal n x n-matrices and operate with 
them as we did in Section |2l For example, 

XiXj — (xjiXji, . . . , XinXjn) , tr(^j) — Xn -|- ■ • • + Xin , 

the observables © can be written as 

(7) qru...,r^iXi, ...,X^)= tr(X[^ . . . X:-MX{^^ . . . X'^-) 
and the functions Em that arise in the phase transition problem Q as 

(8) =tr(Xi)...tr(X„)tr(Xfi...X„i) + 

tr(Xfi)...tr(X„i)tr(Xi...X„). 

Let k{Xi, . . . , Xm)o be the subfield of k{xu,xi2, ■ ■ ■ , Xmn) whose elements 
are rational functions in Xij homogeneous of degree in each n-tuple of 
variables X^. In other words, A;(Xi, . . . , Xm)o is the function field of the 
variety (F"-^)"'. In Section El we studied the action of the group G = 
S„ X T on P"~^; this action extends to an action of G x • • • x G = 
on (P""^)™'. We shall be interested in the invariants for the action of the 
diagonal subgroup of which we shall also denote by G. In concrete 
terms, the symmetric group S„ acts on the function field k{Xi^ . . . , Xm)o of 
^pn-i^m giixiultaneously permuting the variables xn, . . . ,Xin for each i. 
The 2-element group T = {1, r} acts on k{Xi, . . . , Xm)o by 

(9) T : Xij — > — for every z = 1, . . . , m and j = 1, . . . ,n. 

Xij 
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These two actions commute and thus induce an action of G = S„ x T on 
k{Xi, . . . , Xm)o- 

Theorem 5.1. The field k{Xi, . . . ,Xm)Q is generated (over k) by the fol- 
lowing elements: 

:= tr(Xi . . . Xra)tT {Xi... 

and 

g,,o,...,o := tr(X[)tr(Xf , 
qo,r,...,o := tT{X^2MX^n , 

qo,...,o,r ■■= tr(x;;^)tr(x-'') , 

where r ranges from 1 to (n{n — l))/2. 

We will give a proof that generalizes the proof in Section [3 and then 
discuss the prospects for a more constructive proof. Note that our earlier 
results for m = 1 will be used in the proof. 

We begin by disposing of the case n = 2. This case is anomalous in that 
G does not act effectively on k{Xi, . . . ,Xm)o] the kernel of this action is 
the 2-element subgroup of G = S2 x T generated by (o", r), where a is the 
non-trivial element of S2 and r is the non-trivial element of T. Thus for 
n = 2 

k{Xi, . . . , Xm)o = k{Xi, . . . , Xm)o ; 
here r acts via the involution Q. Setting 

f. — -J± 

Xi2 

we see that k{Xi, . . . , Xm)o = k{ti, . . . , tm) and r acts on this field by taking 
ti to 7- for each i = 1, . . . , m. It is now a simple exercise in Galois theory to 
show that in this case 

k{tl, . . . , tm) = k(ti -\- — , . . . , tffi -\- - , ti . . . tm + 7 7 ) • 

tl tm El . . . tm 

Since U + = tr(Xj)tr(Xri) - 2 and 

ti...tm+ ^ = tr(Xi . . . X„)tr (Xi . . . Xm)-^ - 2 , 

tl . . . tm 

this proves Theorem 15. II for n = 2. 

From now on we will assume that n > 3. Let K be the field generated 
over k by the observables listed in the theorem. For h = 1, . . . ,n let 



(10) fh{t) = n(* 



By Lemma f2.3l the coefficients of each lie in K. The field k{Xi, . . . , Xm)o 
is clearly the splitting field of the product 

fit) ■.= fl{t)...fm{t) 
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over K. The Galois group of / contains G and is naturally embedded in the 
product G™" of the Galois groups of the fi. 

Lemma 5.2. Assume n >3. An element g = (gi, . . . , g^) of fixes 

= tr(Xi . . . X^)tr (Xi . . . X^r^ G k{X^, Xm)^ 
if and only if g^ = ■ ■ ■ = g^. 

Proof. One direction is obvious: {gi, . . . , gi) fixes for every gi G G. For 

the purpose of proving the converse, we may replace g by (gjf^, . . . ,gi^)g 
and thus assume gi = id. In other words, we want to prove that ii g = 
{id,g2, ■■■,gn) fixes then g2 = ... gm = id in G. 

For i = 2, ... ,m, let gi = {ai, €{) G G = S„ x T. Here T = {1, r} is written 
multiplicatively, i.e., r is written as —1 and each = ±1. In particular, 

(1,52, •••,5m) • ( ) = ( ) " ■ 

Xhj ^ha^ij) 

Writing out explicitly in terms of the Xij (cf. ©), we obtain 

. . . G K , 

^-gLj "^^j "^"^j •^mj 

and 



91,.. .,1 - "Ti 



Comparing the terms of the last two equations, we see that for each h = 
2, . . . , m, either (i) e/^ = 1 and (T/j(i) = i for every z = 1, . . . , n, i.e., = id, 
or (ii) eh = —1 and (Th{i) = j for every pair of distinct integers i,j between 
1 and n. In case (ii), (T{i) assumes every value between 1 and n other than 
i, which is impossible for n > 3. We conclude that g2 = ■ ■ ■ = gn, as 
claimed. □ 

This shows that G is the Galois group of k(Xi, • • • , Xm)o over K. Exam- 
ining the tower 

k{Xi, . . . , Xm)o 
k{Xi, . . . , Xm)Q 
K 

we conclude that k{Xi, . . . ,Xm)o = K, thus completing the proof of the 
theorem. 

We now remark that Algorithm 13.21 (for m = 1) can be extended to 
the multi-array case by means of a suitable universal denominator. The 
generalization is not entirely straightforward because of the "new" generator 

= tr(Xi . . . Xm)tv (Xi . . . Xm)-^ 

that connects the different Xi. 
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(11) ct>{{ai,...,a^)E) 



Let k[Xf'^,...,X^]Q be the fc-linear combinations of Laurent mono- 
mials x"" . . . x'^^^ such that an + ■ ■ ■ + Um = for every i = 1, . . . ,m. 
Note that k[X^^ , X^^]o is generated, /c-algebra, by elements of the 
form -p-; in particular, the field k(Xi, . . . ,Xm)o is the field of fractions of 
k[X^^, X±i]o. Note also that \ . . . , X^^] is a G-invariant subring 
of k{Xi, . . .,Xm)o- 

Let Zhij be a set of mN algebraically independent variables over k, where 
N = n{n — 1), i and j are distinct integers between 1 and n, and h ranges 
from 1 to m. Let t be another independent variable and let 

k[zhij,t]^k[Xt\...,X^% 

be the surjective /c-algebra homomorphism given by 

H^hij) = — and (f){t) = g'l,...,! - n. 

Xhj 

We note that acts on k[zfiij,t] and where the h-th. component permutes 
the Zfiij and fixes t. 

Our universal denominator is a polynomial E € k[zfiij,t] that has the 
property that for cji , . . . , am G Stv 

U{E) if cji = ••• = a„ gG, 
I in all other cases. 

Here, as before, we view G as a subgroup of Sat for n > 3. 
The polynomial E is defined by 

E{zhij,t) := D{zuj) D{zmij) Ei{zhij,t) , 

where D is the polynomial defined in Section |21 and 

Eiizhij,t) := - ^ zujZ2g2(ij) ■ ■ ■ Zmg^(ij)) ; 

the product is taken over all g2, ■ ■ ■ ,9m & G such that at least one gi ^ 1, 
and the sum is taken over all pairs of distinct integers i and j between 1 and 
n. 

As in the m = 1 case one can verify that E satisfies (|11|) . The algorithm 
for expressing invariants in terms of observables is now similar to the m = 1 
case, and we leave the details to the reader. 

6. Reduction to one set of variables 

We now return to the problem of writing a G-invariant multihomogeneous 
rational function /(xn, a;i2, ■ • • , Xmn) of total degree in each group of vari- 
ables Xi = (xii, . . . , xin), • • • , Xm = {xmii ■ ■ ■ , Xmn), as a rational function 
in the "observables" qri,...,rm- 

In principle, the algorithm sketched in the preceding section is a solution 
to this problem. As one might expect, it is too slow to be of practical 
significance. On the other hand, the approach we took in Section ^ cannot 
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be extended to m > 2, even for small n, because suitable (finite) SAGBI 
bases do not exist; see |R,e| Theorem 1.6 and Example 7.3]. This motivated 
our search for an algorithm that would reduce computations in the multi- 
array case (m > 2) to computations in the single-array case (m = 1). In this 
section we discuss such an algorithm and use it to generate explicit formulas 
for E2iX, Y) for n = 3 and 4. 

Another generating set of observables. We begin by proving another 
variant of Theorem ll.3r a) . 

Theorem 6.1. The field k{Xi, . . . ,Xm)o is generated (over k) by the ele- 
ments 

Qr := qr{Xi) = gr-,o,...,o := tr(X[)tr(Xf '') 

and 

q?^ ■■= qs,i,o,...,o := ti{X-lX2MX^' X^') , 

qi""^ ■■= qs,o...,o,i := tr(XfX„)tr(Xf ^X'l) . 
where r = 1, . . . , "^■'^^ and s = 0, . . . , n{n — 1) — 1. 

Informally speaking, the element of Theorem 15.11 ties Xi, . . . ,Xm 

together, where as the elements q)- of Theorem 16. II relate X-i to Xi for each 
i = 2, . . . , m. 

Proof. Let K = k{Xi^ . . . , Xm)o = k{xij/xii), and let be a vector space 
of dimension = n{n — 1) over K. We shall write elements of as (zij), 
where € A, i.e., i and j are distinct integers between 1 and n, as in ©. 
The natural action of G = S„ x T on A induces a permutation action on 

Given an n-tuple A = (oi, . . . ,a„), we will denote the A^-tuple of ratios 
^ by p{A). Then p: — > is a G-equivariant map. Finally, for 
h = 1, . . . ,m let Zh he the A'-tuple of variables (zhij), where € A. 

We want to show that any f{Xi, . . . , Xm) € k{Xi, . . . , X^)^ can be writ- 
ten as a rational function in the observables listed in the statement of the the- 
orem. We begin with several reductions. First of all, we may assume without 
loss of generality that /(Ai, . . . , Xm) G klxf^, . . . , X^]^ , by writing an in- 
variant rational function as a quotient of invariant polynomials. Secondly, 
f{Xi, . . . , Xm) can be lifted to a G-invariant polynomial F{Zi, . . . , Zm) in 
Zfiij such that 

f{Xi,...,Xm)=F{p{Xi),...,p{Xm)). 

Thirdly, we may assume without loss of generality that F{Zi,... ,Zm) is 
a homogeneous polynomial in the arrays of variables Zi, . . . , Z^a of multi- 
degree (di, . . . , dm). Indeed, any G-invariant F can be written as a sum of 
G- invariant multihomogeneous components, say, F = F\^ ■ ■ ■ ^ F^^ and we 
may replace / by /i = Fi(p(Xi), . . . , p{Xm))- 
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Multilinearizing F, we obtain a G-invariant multilinear polynomial M in 
d = di + ■ ■ ■ + dm A^-variable arrays such that 

(12) f{Xi, ...,Xm)= M{p{Xi), p{Xi), piXm), p{Xra)) . 

^ , ' , ' 

di times d^n times 

Next we observe that by the Vandermonde argument 

(13) /,(/), p(Xi),...,p(Xf-i) 

form a i^-basis of K'^; here / stands for the identity n-tuple (1, . . . , 1). In 
particular, for every 2 < i < m, we can write 

(14) p{Xi) = Xiop{I) + \^lp{Xl) + ■■■ + Ai,jv-ip(Xf -1) . 

for some Ajo, • • • , Kn-i £ . Substituting this into (fT^ and expanding, 
we see that f{Xi, . . . , X^) can be written as a sum of terms of the form 

(monomial in Xij) M{p{X{^), pixi"")). 

(In fact, zi = • • • = irfj = 1 in each term, but we shall not use this in the 
sequel.) Since M is G-invariant, each 

M{piXn...,p{Xi-)) 

is an element of A;[Xi]q', and thus, by Theorem 12.11 it can be written as a 
rational function in the observables Qr = qr{Xi). 

Thus it remains to show that each Xij lies in the field L generated by the 
elements listed in the statement of the theorem. Note that by Theorem 12.11 
the observables Qr lie in L for every r (and not just for r = 1, ... N/2). 
Taking the dot product of both sides of ^ with p{I),p{Xi), p{X^'^), 
and remembering that 

p{Xl) ■ p{X{) = qi+, - n and p{X,) ■ p{Xi) = qf - n, 
we obtain the following linear system: 

N 

(15) '^{qi^j - n)Xi = q^^^ - n, where i = 0, 1, . . . , A'' - 1. 

i=l 

Since p{I), p{Xi), . . . , p{X^^^) form a basis of , the matrix 

/qo-n qi-n ... q^-i - n\ 
I^IQ^ Q ^ Qi-n q2-n ... qN-i - n 

\qN-i-n qN-n ... q2N-2-nJ 

of this system is non-singular. Solving the linear system (|15|). we conclude 
that each Xj lies in L, as claimed. □ 
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Multilinear invariants. Our proof of Theorem 16.11 reduces the computa- 
tion of a given element / of k{Xi, . . . , Xm)o to computing finitely many 
elements in A;[Xi]g^. Note that this reduction is constructive. The result- 
ing algorithm is cumbersome in general; however, it simplifies considerably 
in the case where the polynomial F{Zi, . . . , Zm) is itself multilinear, i.e., 
di = ■ ■ ■ = dm = 1 and M = F. This is exactly what happens in that case 
of greatest interest to us, namely, / = Em (cf. ©); here 



Proposition 6.2. 

Em{Xl. 

where 



E 



ilvjm,i = l 



E 

i2,.--i*m = 



A2n 



«2 
1 ' 



/ ho \ 

Ail 



.1" 



n \ 

n 



\1n-i 



n J 



and Q is the N x N -matrix H16|) . 

Proof. The first formula is obtained by substituting (|14l) into 

Em{Xi, ...,Xm)= F{p{X,), . . .,p{Xm)) 

and expanding the right hand side. (Note that the specific form of F is not 
used here; we only use the fact that F is multilinear.) The second formula 
comes from solving the linear system p5|) . □ 

Remark 6.3. Proposition 16.21 remains valid if Em is replaced by any / 
in k[X^\ . . . ,X±i]Jf such that /(Xi, ...,Xm) = F{p{Xi), . . . , p{Xm)) for 
some G-invariant multilinear polynomial F(Zi,...,Zm) in m A^- variable 
arrays Zi,... ,Zm- 

Explicit determination of the triplet phase invariant. For m = 2 the 

formula of Proposition 16.21 can be rewritten in the matrix form: 



(17) 



E2{X,Y) = (eo,...,ejv-i)Q" 



Qo,i 



\ 



\qN-i,i - nj 



where Cj = ei{X) = E2{X, X^). Here, as usual, we write X for Xi and Y 
for X2; cf. ((2)). We have thus reduced the computation of E2{X,Y) to the 
computation of Cj = E{X,X^) for i = 0, . . . , — 1. Note that cq = 2ngi, 
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SO there are only — 1 elements ei, . . . , cn-i we need to compute. We can 
further reduce this number by using the basis 

p(X-^/'),p{X-^/'+'),...,p{X^/'-') 

of -fC"" instead of ()13() . This has the effect of shifting the subscripts in ()17() 
as follows: 



Proposition 6.4. 



>-i 



Q-N/2,1 -n \ 

Q-N/2+1,1 - n 



E2{X,Y) = (e_7v/2, ei_7v/2) • • • ,eisf/2-l) ■ R 

\ Qn/2-1,1 - n J 
E2{X,X'^), and R is the N x N -matrix 



where qr = qr,o, 



R 



-N+i - n 



\ 9-1 



n 



-N+2 - n 
qo-n 



q-i - n 
qo-n 



\ 



qN-2 -nj 



□ 



Keeping in mind the identities Cj = e_i_j and cq = 2ngi, we see that 
the formula of Proposition IH.4I reduces the computation of E2{X,Y) to the 
computation of = E2{X, X^) for i = 1, . . . , {N/2) — 1. We also note that 
since q-r = qr, the matrix R involves qr only for r = 1, . . . , A^, as opposed 
to r = 1, . . . , 2iV - 2 for the matrix (fTHjl . 

The calculation of the Cj involves only one set of variables. For n = 3 we 
have the following explicit results. 

Example 6.5. Let n = 3. Then = 6, qo = 9, and Proposition 16.41 gives 
E2{X,Y) = {e2,ei,6qi,6qi,ei,e2) R~^ 



Q-2,1 - 3 

V 92,1 - 3 y 



where R is the 6 x 6-matrix 



R 





- 3 


95 - 


3 


94 - 


3 


93 - 


3 


92 - 


3 


91 - 


3\ 


q5 


- 3 


94 - 


3 


93 - 


3 


92 - 


3 


91 - 


3 


6 




qi 


- 3 


qs - 


3 


92 - 


3 


91 - 


3 


6 




91 - 


3 


q3 


- 3 


q2 - 


3 


91 - 


3 


6 




91 - 


3 


92 - 


3 


q2 


- 3 


qi - 


3 


6 




91 - 


3 


92 - 


3 


93 - 


3 


\qi 


- 3 


6 




91 - 


3 


92 - 


3 


93 - 


3 


94 - 


y 
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Recall that ei and 62 were computed in Example 14.41 bv using SAGBI basis 
techniques: 



ei = 2ci + 2ci - 2c2 

62 = 2cf + 5cl - 5ciC2 + 9ci - 12c2 + 18 , 



These can be easily expressed in terms of qi and q2 by using Newton's 
identities, which here amount to 



ci = gi - 3 and C2 = {{qi - 3)^ - {q2 - 3))/2. 



The explicit formula for E2 that results is quite different from, and consid- 
erably more elaborate than, the formula given in Example 11.21 



Example 6.6. Let n = 4. Then N = 12, 
Proposition 16.41 reduces to 



16, and the formula of 



E2{X,Y) = (65,64,63,62,61,891,8^1,61,62,63,64,65) -R 



-1 



q-2,1 - 4 

V ^5,1 - 4 / 



where R is the 12 x 12-matrix 



R 



/912 - 4 qii - 4 
qii - 4 910-4 
910-4 99-4 



V91-4 



12 



93 - 4 92 - 4 91 - 4 \ 

92 - 4 91 - 4 12 

91-4 12 91-4 

99 - 4 910 - 4 qii-A) 



The elements 61 , . . . , 65 can again be explicitly determined using the SAGBI 
basis subduction algorithm; the result is: 
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ei = 2cj + 8ci - 2c2 - 4p + 20 

62 = 2c? + 4c?-5ciC2-cip + 7ci -5c2 + 3c3 + 2p + 2 

63 = 2cf + 4c? - 7c? C2 + 3c?p - 41c? - 31ciC2 

+7ciC3 - 2cip^ + 40cip - 154ci + 8c2P - 28c2 
+2c3P + 24c3 - + 72p - 136 

64 = 2cf + 4cf - 9c? C2 + 13c?p - 131c? - 80c? C2 + 9c? C3 

-7c?/ + 144c?p - 651c? + 24ciC2P - 186ciC2 + 7ciC3p 
+43ciC3 - 29ci/ + 379cip - 1136ci - 5C2C3 + 21c2p 
-120c2 + 3c3p + 78c3 - Mp^ + 334p - 688 

65 = 2c? + 4c? - llcfc2 + 27c?p - 243c? - 142c? C2 

+llc?C3 - 14c?p2 + 289c?p - 1317c? + 46c? C2P 

-429c? C2 + 14c? C3P + 53c?C3 - 66c?p2 + 892c?p 

—2830c? — I7C1C2C3 + 8IC1C2P — 471ciC2 + 19ciC3p 

+135ciC3 + 2ci/ - 137ci/ + 1286cip - 3039ci - 24 

-35c2C3 + 27c2P - 167c2 + 3c| - 2c3p^ - csp 

+I7IC3 + 8p^ - 150p2 + 856p - 1402 

Then we eliminate p, by using the formula in Example 14.51 to express 
61, ... ,65 as rational functions in ci, C2 and C3. Finally, we use Newton's 
identities to rewrite each Cj as a rational function of qi, q2 and ^3. 

Remark 6.7. The SAGBI basis algorithms were implemented explicitly in 
magma jBCPj . and are very efficient, though of course they are limited to 
n < 4. 

7. Regular invariants 

Theorems 15.11 and 16.11 tell us that if we allow ri , . . . , to range over the 
integers then the observables (i'ri,...,r„i(^i5 • • • , Xm) generate k{Xi, . . . , Xjn)^ 
as a field extension of k. H. Hauptman asked us whether the functions Em 
in (jSJ that arise in the phase determination problem @ can be written 
as polynomials, rather than rational functions in the qri,...,rm- More gener- 
ally, we can ask whether the the observables q'ri,...,r„i generate the /c-algebra 
k[X^^, . . . , Xm^]Q . In this section, we will show that the answer is generally 
"no." 

Proposition 7.1. Assume n > 4. 

(a) The function 

fix) = E2{X, X) = tr2(X)tr(X-2) + tr^{X-^)tTiX^) 

cannot be written as a polynomial in the observables qr, as r ranges over the 
integers. 
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(b) Suppose m >2. Then function Em{Xi, . . . ,Xm) given by cannot 
be written as a polynomial in the observables gri,...,r„; as ri,...,rm range 
over the integers. 

(c) For any m > 1 the k-algebra k[Xf'^,...,X^]Q is not generated by 
the observables qi^^...^i^, as ii, . . . ,im range over the integers. 

By Corollary 14. ^f a) and (b), / can be written as a polynomial in the 
observables if n = 2 or 3; i.e., part (a) is not true for n = 2 or 3. Also, part 
(b) is not true for m = 1 (indeed, Ei is itself an observable). 

Curiously, as we saw in the Sectional the observables qi^^...^i^ come close 
to generating the fc-algebra k[Xf'^., . . . ,X^]q in the following sense: every 
a G k[Xf''^, . . . , X^-'^Jq' can be written in the form 

for some [3 G k[gh,...,i,n]- Here (l){E) is a fixed nonzero element of k[qi-^^,,,^i^]., 
independent of a. 

Proof. Part (b) is easily deduced from part (a) by specializing Xi and X2 
to a single array of indeterminates X = (xi, . . . , and X^, . . . , Xm to the 
"identity array" I = (1, . . . , 1). Part (c) is a consequence of parts (a) (for 
m = 1) and (b) (for m > 2), since / G k[X]^ and Em G k[X^^ , . . .,X^^]^ 
for every m >2. 

Thus we only need to prove (a). Let 

p: A''-{xi...x„ = 0} — >A^ 

be the map given by 

(18) /j(a;i,...,x„) = ( — ,—,..., ^^^). 

To prove part (a), we claim that it suffices to exhibit n-tuples x and y with 
nonzero coordinates such that 

(i) fix) + f{y) and (ii) p{x) ~ p{y), 

where two A^-tuples u and v are equivalent, written n ~ if one is a 
permutation of the other. 

Indeed, assume that (i) and (ii) are true. Since the observables qr are 
symmetric functions in {xi/xj}, where i,j = 1, . . . ,n, i ^ j, the fact that 
p{x) ~ p{y) implies that qr{x) = qr{y) for every integer r. Thus q{x) = q{y) 
for every q G k[qi,q2, ■ ■ ■]] given that f{x) 7^ /(?/), we immediately deduce 
that f ^ k[qi, q2, . . .] as desired. Note that it is sufficient if the coordinates 
of X and y lie in the algebraic closure of the field k. 

First consider the case n > 4. Let z be a primitive n-th root of unity. 
(Recall that we are assuming that char (k) = 0, so that z exists in the 
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algebraic closure of k.) We claim that the points 

satisfy conditions (i) and (ii). Here x is obtained from the point (1, z, z^, • • • , z^~^) 
by replacing z by 1, and by z^, and y is obtained from (1, z, z'^ , ■ ■ ■ , z"^^) 
by a similar alteration of two coordinates. Note that 

(19) tr(x^) + tr(y*) = 2(1* + z' + z^' + ■ ■ ■ + z^""!)^) = 

for any i ^ (mod n). In particular, tr(a;*) = — tr(y*) for i = ibl,ib2 and 
consequently, f{x) = —f{y). Thus, in order to check (i), we only need to 
verify that /(x) / 0. This is easily done; substituting 

tr(x*) = l-z'-z'^' + z^' = (1 - z'){l - z^^) 

into f{x) = tr^(x)tr(a;~^) + tr^(x^^)tr(x^), we see that 

f{x) = 2z-6(l - zf{l - z^f{l -z^)^0 

for any n > 4. 

Now we have to show that p{x) ^ p{y)- To do this let Gx{t) denote the 
formal generating function 

Gx{t)=2 + 2t^ + t^ + ---+ G ^t]/{e - 1) 

in which the coefficient of f is the number of times that appears as a 
coordinate in x. Then Gj.(t)G^(t~^) G Z[t]/(r - 1) is the formal generating 
function of p{x). Thus (ii) is equivalent to 

(20) Gx{t)Gx{t-^) = Gy{t)Gy{r^) in Z[t]/(r - 1). 

Using the factorization t"-l = (t-l)(r-^H hi), we get an isomorphism 

Z[i]/(r - 1) ~ Z e i?, where R := Z[t]/(r-^ + • • • + 1). 

Here t maps to 1 in the first summand. We see that in order to prove (|2()|) 
it suffices to check that the two sides have the same images in Z and in R. 
The image in Z is obtained by evaluating at 1, and ^^^(l) = Gy{l) = n, so 
the desired identity is immediate in Z. If and gy denote the images of Gx 
and Gy in R, then a calculation similar to (|19|) show that gy{t) = —gx{t)- 

It follows immediately that gx{t)9x{t~^) = 9y{i)9y{i~^) desired. This 
finishes the proof for n > 4. 

For n = 4 we have to consider a more carefully crafted example. Let z be 
a primitive 13-th root of unity, and set 

(21) x={l,z,z\z^), y=(l,z,z^z"). 

An easy calculation, which we will leave to the reader, shows that f{x) = 
~f{y) 7^ 0) s-iid a generating function argument shows that p{x) ~ p{y)- 
(This choice of x and y is based on the fact that {0, 1, 4, 6} and {0, 1, 4, 11} 
are inequivalent planar difference sets for the cyclic group Z/13Z; cf. |Ry[ 
Chapter 9].) 

This completes the proof of Proposition 17.11 □ 
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8. Rationality 

In this section we investigate the structure of the field k{Xi, . . . ,Xm)Q, 
without specific reference to the observables. The main result, Proposi- 
tion |H3 below, was communicated to us by M. Lorenz. 

Recall that a field extension K/k is called rational (or equivalently, K is 
said to be rational over k) K = k(ti, . . . ,tr) for some elements ti, . . . ,tr G 
K, algebraically independent over k. The extension K/k is called stably 
rational if there exists a field L, containing K, which is rational over both K 
and k. In other words, for some finite collection of indeterminates si, . . . ,Sr 
the field L = K(si, . . . , Sr) is rational over k. 

Proposition 8.1. (a) For m = 1, k{X)Q is rational over k. 
(b) For any m>\, k{Xi, . . . ,Xm)o is stably rational over k. 

Proof, (a) First assume n = 2. Then k{X)'-' C k{X)Q = k{xi/x2), and 
the desired conclusion follows from Liiroth's theorem. For n > 3, part 
(a) is a special case of a theorem of N. Lemire; see ^ Theorem 7.7]. To 
see how Lemire's theorem applies, note that the elements Xi/xj (viewed 
additively, with the natural S„-action) form the root system An-i- The 
group G = S„ X T is the automorphism group of this root system, and 
k{X)Q = k(M), where M is the lattice (i.e., the multiplicative subgroup of 
/c(X)q) generated by {xi/xj \ i,j = 1, . . . ,n}. 

(b) We argue by induction on m. The base case, m = 1, is part (a). For 
the induction step, assume m > 2. Let Km = k{Xi, . . . , Xm)o and L = 
Km-i{xi, . . . , x„), where = (xi, . . . , By the induction assumption, 
^m-i is stably rational over k; thus it is enough to show that K^ is stably 
rational over K^_j^. We will do this by proving that L*^ is rational over 
both K^ and K^_^. 

Note that since L = Km{xi) = K^-iixi, . . . , Xn) and the G-action on 
Kjn-i is faithful, the desired conclusion would follow from the no-name 
lemma, if G-acted linearly (or, more precisely, semi-linearly) on the variables 
e.g., |EM1 Proposition 1.1] or |Shl Appendix 3]. However, the 
no-name lemma cannot be used directly in this setting because our G-action 
is not semi-linear (r acts by inversion!). 

Fortunately, our action can be easily linearized, following an argument of 
Hajja and Kang j HKl Lemma 2.3(i)]. Let yi = j^f^- Then permutes 

yi, . . . , y„ and r sends each y^ to —yi. Since Xi = we have 

L = Km{yi) = Km-i{yi, ... ,yn) ■ 
Now the no-name lemma tells us that L*^ is rational over both K^ and 
-^m-i' ^s claimed. □ 

Remark 8.2. Let H be the subgroup of index 2 in G = Sn x T consisting 
of pairs of the form (u, r*^'^'^'^). Hajja and Kang HKi Theorem 3.2] have 
shown that k{Xi, . . . , Xm)^ is rational over k. 
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